SITE LINK

KMID : 1022420160080030031

Phonetics and Speech Sciences
2016 Volume.8 No. 3 p.31 ~ p.38

Development of articulatory estimation model using deep neural network

You Hee-Jo

Yang Hyung-Won
Kang Jae-Koo
Cho Young-Sun
Hwang Sung-Hah
Hong Yeon-Jung
Cho Ye-Jin
Kim Seo-Hyun
Nam Ho-Sung

Abstract

Speech inversion (acoustic-to-articulatory mapping) is not a trivial problem, despite the importance, due to the highly non-linear and non-unique nature. This study aimed to investigate the performance of Deep Neural Network (DNN) compared to that of traditional Artificial Neural Network (ANN) to address the problem. The Wisconsin X-ray Microbeam Database was employed and the acoustic signal and articulatory pellet information were the input and output in the models. Results showed that the performance of ANN deteriorated as the number of hidden layers increased. In contrast, DNN showed lower and more stable RMS even up to 10 deep hidden layers, suggesting that DNN is capable of learning acoustic-articulatory inversion mapping more efficiently than ANN.

KEYWORD

the Wisconsin X-ray Microbeam Database, speech inversion, artificial neural network, deep neural network

FullTexts / Linksout information

Listed journal information

site infomation

Prohibition of Unauthorized Collection of E-mail Addresses, medric.kyung@gmail.com
N4 301, Chungbuk National University, Chungdae-ro 1, Seowon-Gu, Cheongju, Chungbuk 28644, Korea